Black Box Features for the WMT 2012 Quality Estimation Shared Task
نویسنده
چکیده
In this paper we introduce a number of new features for quality estimation in machine translation that were developed for the WMT 2012 quality estimation shared task. We find that very simple features such as indicators of certain characters are able to outperform complex features that aim to model the connection between two languages.
منابع مشابه
UAlacant word-level machine translation quality estimation system at WMT 2015
This paper describes the Universitat d’Alacant submissions (labelled as UAlacant) for the machine translation quality estimation (MTQE) shared task in WMT 2015, where we participated in the wordlevel MTQE sub-task. The method we used to produce our submissions uses external sources of bilingual information as a black box to spot sub-segment correspondences between a source segment S and the tra...
متن کاملThe UPC Submission to the WMT 2012 Shared Task on Quality Estimation
In this paper, we describe the UPC system that participated in the WMT 2012 shared task on Quality Estimation for Machine Translation. Based on the empirical evidence that fluencyrelated features have a very high correlation with post-editing effort, we present a set of features for the assessment of quality estimation for machine translation designed around different kinds of n-gram language m...
متن کاملUAlacant word-level and phrase-level machine translation quality estimation systems at WMT 2016
This paper describes the Universitat d’Alacant submissions (labeled as UAlacant) to the machine translation quality estimation (MTQE) shared task at WMT 2016, where we have participated in the word-level and phrase-level MTQE subtasks. Our systems use external sources of bilingual information as a black box to spot sub-segment correspondences between the source segment and the translation hypot...
متن کاملRanking Translations using Error Analysis and Quality Estimation
We describe TerrorCat, a submission to this year’s metrics shared task. It is a machine learning-based metric that is trained on manual ranking data from WMT shared tasks 2008–2012. Input features are generated by applying automatic translation error analysis to the translation hypotheses and calculating the error category frequency differences. We additionally experiment with adding quality es...
متن کاملMT Quality Estimation: The CMU System for WMT'13
In this paper we present our entry to the WMT’13 shared task: Quality Estimation (QE) for machine translation (MT). We participated in the 1.1, 1.2 and 1.3 sub-tasks with our QE system trained on features from diverse information sources like MT decoder features, n-best lists, monoand bi-lingual corpora and giza training models. Our system shows competitive results in the workshop shared task.
متن کامل